Recombinant DNA expression vectors

ABSTRACT

The invention provides expression vectors containing the promoter, enhancer and substantially complete 5&#39;-untranslated region including the first intron of the major immediate early gene of human cytomegalovirus. Further vectors including the hCMV-MIE DNA linked directly to the coding sequence of a heterologous gene are described, Host cells transfected with the vectors and a process for producing heterologous polypeptides using the vectors and the use of the hCMV-MIE DNA for expression of a heterologous gene are also included within the invention.

This application is a continuation application Ser. No. 07/896,797, filed Jun. 9, 1992, now abandoned, which is a continuation of application Ser. No. 07/339,615, filed Apr. 28, 1989, now abandoned.

FIELD OF THE INVENTION

This invention relates to expression vectors containing a DNA sequence from the human cytomegalovirus major immediate early gene, to host cells containing such vectors, to a method of producing a desired polypeptide by using vectors containing said sequence and to the use of said DNA sequence.

BACKGROUND TO THE INVENTION

The main aim of workers in the field of recombinant DNA technology is to achieve as high a level of production as possible of a particular polypeptide. This is particularly true of commercial organisations who wish to exploit the use of recombinant DNA technology to produce polypeptides which naturally are not very abundant.

Generally the application of DNA technology involves the cloning of a gene encoding the desired polypeptide, placing the cloned gene in a suitable expression vector, transferring a host cell line with the vector, and culturing the transferred cell line to produce the polypeptide. It is almost impossible to predict whether any particular vector or cell line or combination thereof will lead to a useful level of production.

In general, the factors which significantly affect the amount of polypeptide produced by a transferred cell line are: 1. gene copy number, 2. efficiency with which the gene is transcribed and the mRNA translated, 3. the stability of the mRNA and 4. the efficiency of secretion of the protein.

The majority of-work directed at increasing expression levels of recombinant polypeptides has focussed on improving transcription initiation mechanisms. As a result the factors affecting efficient translation are much less well understood and defined, and generally it is not possible to predict whether any particular DNA sequences will be of use in obtaining efficient translation.

Attempts at investigating translation have consisted largely of varying the DNA sequence around the consensus translation start signal to determine what effect this has on translation initiation (Kozak H. Cell 41 283-292 (1986)).

Studies involving expression of desired heterologous genes normally use both the coding sequence and at least part of the 5'-untranslated sequence of the heterologous gene such that translation initiation is from the natural sequence of the gene. This approach has been found to be unreliable probably as a result of the `hybrid nature` of the 5'-untranslated region and the fact that the presence of particular 5-untranslated sequences can lead to poor initiation of translation (Kozak H. Procl. Natl. Acad. Sci. 83 2850-2854 (1986) and Pelletier and Sonenberg Cell 40 515-526 (1985)). This variation in translation has a detrimental effect on the amount of the product produced.

Previous studies (Boshart et al Cell 41 521-530 (1985) and Pasleau et al, Gene 38 227-232 (1985); Stenberg et al, J. Virol 49 (1) 190-199 (1984); Thomsen et al Proc. Natl. Acad. Sci. USA 81 659-663 (1984) and Foecking and Horstetter Gene 65 101-105 (1986)) have used sequences from the upstream region of the hCMV-MIE gene in expression vectors. These have, however, solely been concerned with the use of the sequences as promoters and/or enhancers. Spaeta and Mocarski (J. Virol 56 (1) 135-143, 1985) have used a PstI to PstI fragment of the hCMV-MIE gene encompassing the promoter, enhancer and part of the 5'-untranslated region, as a promoter for expression of heterologous genes. In order to obtain translation the natural 5'-untranslated region of the heterologous gene was used.

In published European Patent Application No. 260148, a method for the continuous production of a heterologous protein is described. The expression vectors constructed contain part of the 5'-untranslated region of the hCMV-MIE gene as a stabilising sequence. The stabilising sequence is placed in the 5'-untranslated region of the gene encoding the desired heterologous protein i.e. the teaching is again that the natural 5'-untranslated region of the gene is essential for translation.

SUMMARY OF THE INVENTION

In a first aspect the invention provides a vector containing a DNA sequence comprising the promoter, enhancer and substantially complete 5'-untranslated region including the first intron of the major immediate early gene of human cytomegalovirus.

In a preferred embodiment of the first aspect of the invention, the vector includes a restriction site for insertion of a heterologous gene.

The present invention is based on the discovery that vectors containing a DNA sequence comprising the promoter, enhancer and complete 5'-untranslated region of the major immediate early gene of the human cytomegalovirus (hCMV-MIE) upstream of a heterologous gene result in high level expression of the heterologous gene product. In particular, we have unexpectedly found that when the hCMV-MIE derived DNA is linked directly to the coding sequence of the heterologous gene high levels of mRNA translation are achieved. This efficient translation of mRNA is achieved consistently and appears to be independent of the particular heterologous gene being expressed.

In a second aspect the invention provides a vector containing a DNA sequence comprising the promoter, enhancer and substantially complete 5'-untranslated region including the first intron of the major immediate early gene of human cytomegalovirus upstream of a heterologous gene.

The hCMV-MIE derived DNA according to the second aspect of the invention may be separated from the coding sequence of the heterologous gene by intervening DNA such as for example by the 5'-untranslated region of the heterologous gene. Advantageously the hCMV-MIE derived DNA may be linked directly to the coding sequence of the heterologous gene.

In a preferred embodiment of the second aspect of the invention, the invention provides a vector containing a DNA sequence comprising the promoter, enhancer and substantially complete 5'-untranslated region including the first intron of the hCMV-MIE gene linked directly to the DNA coding sequence of the heterologous gene.

Preferably the hCMV-MIE derived sequence includes a sequence identical to the natural hCMV-MIE translation initiation signal. It may however be necessary or convenient to modify the natural translation initiation signal to facilitate linking the coding sequence of the desired polypeptide to the hCMV-MIE sequence, i.e. by introducing a convenient restriction enzyme recognition site. For example the translation initiation site may advantageously be modified to provide an NcoI recognition site.

The heterologous gene may be a gene coding for any eukaryotic polypeptide such as for example a mammalian polypeptide such as an enzyme, e.g. chymosin or gastric lipase; an enzyme inhibitor, e.g. tissue inhibitor of metalloproteinase (TIMP); a hormone, e.g. growth hormone; a lymphokine, e.g. an interferon; a plasminogen activator, e.g. tissue plasminogen activator (tPA) or prourokinase; or a natural, modified or chimeric immunoglobulin or a fragment thereof including chimeric immunoglobulins having dual activity such as antibody-enzyme or antibody-toxin chimeras.

According to a third aspect of the invention there is provided host cells transfected with vectors according to the first or second aspect of the invention.

The host cell may be any eukaryotic cell such as for example plant, or insect cells but is preferably a mammalian cell such as for example CHO cells or cells of myeloid origin e.g. myeloma or hybridoma cells.

In a fourth aspect the invention provides a process for the production of a heterologous polypeptide by culturing a transfected cell according to the third aspect of the invention.

In a fifth aspect the invention provides the use of a DNA sequence comprising the promoter, enhancer and substantially complete 5'-untranslated region including the first intron of the hCMV-MIE gene for expression a heterologous gene.

In a preferred embodiment of the fifth aspect of the invention the hCMV-MIE derived DNA sequence is linked directly to the DNA coding sequence of the heterologous gene.

Also included within the scope of the invention are plasmids pCMGS, pHT.1 and pEE6hCMV.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is now described, by way of example only, with reference to the accompanying drawings in which

FIG. 1: shows a diagrammatic representation of plasmid pSVLGS.1

FIG. 2: shows a diagrammatic representation of plasmid pHT.1

FIG. 3: shows a diagrammatic representation of plasmid pCMGS

FIG. 4: shows the complete sequence of the promoter-enhancer hCMV-MIE including both the first intron and a modified translation `start` site

FIG. 5: shows a diagrammatic representation of plasmid pEE6.hCMV

DETAILED DESCRIPTION OF THE EMBODIMENTS Example 1

The Pst-lm fragment of hCMV (Boshart et al Cell 41 521-530 (1985) Spaeta & Mocarski J. Virol 56 (1) 135-143 (1985)) contains the promoter-enhancer and most of the 5'-untranslated leader of the MIE gene including the first intron. The remainder of the 5'untranslated sequence can be recreated by attaching a small additional sequence of approximately 20 base pairs.

Many eukaryotic genes contain an Ncol restriction site (5'-CCATGG-3') overlapping the translation start site, since this sequence frequently forms part of a preferred translation initiation signal 5'ACCATGPu-3'. The hCMV-MIE gene does not have an Ncol site at the beginning of the protein coding sequence but a single base-pair alteration causes the sequence both to resemble more closely the "Kozak" concensus initiation signal and introduces an Ncol recognition site. Therefore a pair of complementary oligonucleotides were synthesised of the sequence: ##STR1## which when fused to the Pst-lm fragment of hCMV will recreate the complete 5'-untranslated sequence of the MIE gene with the single alteration of a G to a C at position -1 relative to the translation initiation codon.

This synthetic DNA fragment was introduced between the hCMV Pst-lm promoter-enhancer leader fragment and a glutamine synthetase (GS) coding sequence by ligation of the Pst-lm fragment and the synthetic oligomer with Ncol digested pSV2.GS to generate a new plasmid. pCMGS (The production of pSV2.GS is described in published International Patent Application No. WO 8704462). pCMGS is shown in FIG. 3. pCMGS thus contains a hybrid transcription unit consisting of the following: the synthetic oligomer described above upstream of the hCMV-MIE promoter-enhancer (where it serves merely as a convenient Pstl - Ncol "adaptor"), the hCMV-MIE promoter and the complete 5' untranslated region of the MIE gene, including the first intron, fused directly to the GS coding sequence at the translation initiation site.

pCMGS was introduced into CHO-KI cells by calcium phosphate mediated transfection and the plasmid was tested for the ability to confer resistance to the GS-inhibitor methionine sulphoximine (HSX). The results of a comparison with pSV2.GS are shown in Table 1.

It is clear that pCMGS can confer resistance to 20 μM MSX at a similar frequency to pSV2.GS, demonstrating that active GS enzyme is indeed expressed in this vector.

                  TABLE 1                                                          ______________________________________                                         Results of transfection of GS-expression                                       vectors into CHO--KI cells                                                                 no. colonies/10.sup.6 cells                                        Vector      resistant to 20 μM MSX                                          ______________________________________                                         pSV2.GS     32                                                                 pCMGS       17                                                                 Control      0                                                                 ______________________________________                                    

Example 2

The TIMP cDNA and SV40 polyadenylation signal as used in pTIMP 1 Docherty et al (1985) Nature 318, 66-69, was inserted into pEE6 between the unique HindIII and BamHI sites to create pEE6TIMP. pEE6 is a bacterial vector from which sequences inhibitory to replication in mammalian cells have been removed. It contains the XmnI to BclI portion of pCT54 (Emtage et al 1983 Proc. Natl. Aced. Sci. USA 80, 3671-3675) with a pSP64 (Melton et al 1984: Nucleic Acids. Res. 12, 7035) polylinker inserted in between the HindIII and EcoRI sites. The BamHI and SalI sites have been removed from the polylinker by digestion, filling in with Klenow enzyme and religation. The BclI to BamHI fragment is a 237 bp SV40 early polyadenylation signal (SV40 2770 to 2533). The BamHI to the BglI fragment is derived from pBR328 (375 to 2422) with an additional deletion between the SalI and the AvaI sites (651 to 1425) following the addition of a SalI linker to the AvaI site. The sequence from the BglI to the XmnI site originates from the β-lactamase gene of pSP64.

The 2129 base-pair Ncol fragment containing the hCMV MIE promoter-enhancer and 5' untranslated sequence was isolated from pCMGS by partial Ncol digestion and inserted at the Ncol site overlapping the translation initiation signal of TIMP in pEE6.TIMP to generate the plasmid pMT.1 (shown in FIG. 2).

A GS gene was introduced into pMT.1 to allow selection of permanent cell lines by introducing the 5.5K Pvul - BamHl fragment of pSVLGS.1 (FIG. 1) at the BamHl site of pHT.1 after addition of a synthetic BamHl linker to Pvul digested pSVLGS.1 to form pHT.1GS. In this plasmid the hCMV-TIMP and GS transcription units transcribe in the same orientation.

pHT.1 GS was introduced into CHO-Kl cells by calcium-phosphate mediated transfection and clones resistant to 20 μM MSX were isolated 2-3 weeks post-transfection. TIMP secretion rates were determined by testing culture supernatants in a specific two site ELISA, based on a sheep anti TIMP polyclonal antibody as a capture antibody, a mouse TIMP monoclonal as the recognition antibody, binding of the monoclonal being revealed using a sheep anti mouse IgG peroxidase conjugate. Purified natural TIMP was used as a standard for calibration of the assay and all curves were linear in the range of 2-20 ng ml⁻¹. No non-specific reaction was detectable in CHO-cell conditioned culture media.

One cell line GS.19 was subsequently recloned, and a sub-clone GS 19-12 secretes TIMP at a very high level of 3×10⁸ molecules/cell/day. Total genomic DNA extracted from this cell line was hybridised with a TIMP probe by Southern blot analysis using standard techniques and shown to contain a single intact copy of the TIMP transcription unit per cell (as well as two re-arranged plasmid bands). This cell line was selected for resistance to higher levels of MSX and in the first selection a pool of cells resistant to 500 μM MSX was isolated and recloned. The clone GS-19.6(500)14 secretes 3×10⁹ molecules TIMP/cell/day. The vector copy-number in this cell line is approx. 20-30 copies/cell. Subsequent rounds of selection for further gene amplification did not led to increased TIMP secretion.

Thus it appears that the hCMV-TIMP transcription unit from pHT.1 can be very efficiently expressed in CHO-KI cells at approximately a single copy per cell and a single round of gene amplification leads to secretion rates which are maximal using current methods.

Example 3

In order to test whether the hCMV-MIE promoter-enhancer-leader can be used to direct the efficient expression of other protein sequences, two different but related plasminogen activator coding sequences (designated PA-1 and PA-2) were introduced into CHO-KI cells in vectors in which the protein coding sequences were fused directly to the hCMV sequence.

In both these cases, there is no Ncol site at the beginning of the translated sequence and so synthetic oligonucleotides were used to recreate the authentic coding sequence from suitable restriction sites within the translated region. The sequence of the modified hCMV translation-initiation signal as used in pHT.1 was also built into the synthetic oligonucleotide which then ended in a Pst-1 restriction site. The Pst-lm fragment of hCMV was then inserted at this site to create the complete promoter-enhancer-leader sequence.

The hCMV-plasminogen activator transcription units were introduced into CHO-KI cells after inserting a GS gene at the unique BamHl site as above end MSX resistant cell lines secreting plasminogen activator were isolated.

The secretion rates of the best initial transfectant cell lines in each case are given in Table 2. From this it is clear that the hCMV promoter-enhancer leader can also be used to direct the efficient expression of these two plasminogen activator proteins.

                  TABLE 2                                                          ______________________________________                                         Secretion rates of the different plasminogen activator proteins                from transfectant CHO cell lines.                                              Plasminogen activator                                                                         Molecules secreted/cell/day                                     ______________________________________                                         PA-1           5.5 × 10.sup.7                                            PA-2           1.1 × 10.sup.8                                            ______________________________________                                    

Example 4

pEE6hCMV was made by ligating the Pst-lm fragment of hCMV, HindIII-digested pEE6 and the complementary oligonucleotides of the sequence: ##STR2##

cDNA encoding an immunoglobulin light-chain was inserted at the EcoRI site of pEE6.hCMV such that the hCMV-MIE promoter-enhancer leader could direct expression of the cDNA and a selectable marker gene containing the SV40 origin of replication was inserted at the BamHI site of each plasmid.

This plasmid was transfected into COS-1 monkey kidney cells by a standard DEAE-dextran transfection procedure and transient expression was monitored 72 hours post transfection. Light chain was secreted into the medium at at least 100 ng/ml indicating that light chain can indeed be expressed from a transcription unit containing the entire hCMV-MIE 5'-untranslated sequence up to but not including the translation initiation ATG, followed by 15 bases of natural 5'-untranslated sequence of the mouse immunoglobulin light-chain gene. 

I claim:
 1. Plasmids pCMGS, pEE6.hCMV and pHT.1.
 2. A recombinant expression vector comprising the promoter, enhancer and complete 5' untranslated region including the first intron of the hCMV-MIE gene operably linked to a heterologous coding sequence.
 3. The recombinant expression vector according to claim 2 further comprising a restriction site to facilitate insertion of the heterologous coding sequence.
 4. The recombinant expression vector according to claim 2 wherein the promoter, enhancer and complete 5' untranslated region including the first intron of the hCMV-MIE gene are linked directly to the heterologous coding sequence.
 5. The recombinant expression vector according to claim 2 wherein the vector further includes the translation initiation signal of the hCMV MIE gene translational initiation signal.
 6. The recombinant expression vector according to claim 5 wherein the translational initiation signal includes the sequence 5'-GTCACCGTCCTTGACACCATG-3'.
 7. The recombinant expression vector according to claim 5 wherein the translational initiation signal includes the sequence 5'-CCATGG-3'. 